Nested Satisfiability 

by Donald E. Knutb* 



Abstract. A special case of the satisfiability problem, in which the clauses have 
a hierarchical structure, is shown to be solvable in linear time, assuming that the 
clauses have been represented in a convenient way. 



Let X be a finite alphabet linearly ordered by < ; we will think of the elements of X 
as boolean variables. As usual, we define the literals over X to be elements of the form x 
or x, where x G X. Literals that belong to X are called positive; the others are called 
negative. 

The linear ordering of X can be extended to a linear preordering of all its literals in 
a natural way if we simply disregard the signs. For example, if X = {a, 6, c} has the usual 
alphabetic order, we have 

a = a < b = b < c=c. 

If a and r are literals, we write a < r if a < r or a = r; this holds if and only if the 
relation a > r is false. 

A clause over X is a set of literals on distinct variables. Thus, the literals of a clause 
can be written in increasing order, 

<Ti < cr 2 < • ■ • < o k . 

A set C of clauses over X is satisfiable if there exists a clause over X that has a nonempty 
intersection with every clause in C. For example, the clauses 

{a,b,c} {a, c} {a,b,c} {a, c} {a,b} 

over {a, b, c} are satisfiable uniquely by the clause {a, b,c}. 

We say that clause C straddles clause C if there are literals a, r in C and £' in C such 

that 

a <£' <t . 

Two clauses overlap if they straddle each other. For example, {a,b, c} and {a, b, c} overlap; 
but the other nine pairs of clauses in the example above are non-overlapping. Clauses on 
two elements each, like {a, c} and {b, d}, can also be overlapping. A set of clauses in which 
no two overlap is called nested. 

The general problem of deciding whether a given set of clauses is satisfiable is well 
known to be NP-complete. But we will see that the analogous question for nested clauses is 
efficiently decidable. The main reason for interest in nested clauses is David Lichtenstein's 
theorem of planar satisfiability, which can be restated in algebraic terms as follows: The 
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joint satisfiability problem for two sets C,C of nested clauses is NP-complete. In fact, 
Lichtenstein proved [1, p. 339] that this problem is NP-complete even if all clauses of C 
contain only positive literals and all clauses of C contain only negative literals, with at 
most three literals per clause. 

1. Structure of nested clauses. A clause over an ordered alphabet has a least literal a 
and a greatest literal r. Any variable that lies strictly between a and r is said to be interior 
to that clause. A variable can occur as an interior literal at most once in a set of nested 
clauses; for if it is an interior literal in two different clauses, those clauses overlap. Hence, 
the total number of elements among m nested clauses on n variables is at most 2m + n. 

Let us write C y C if C straddles C but C does not straddle C. This relation is 
transitive. For if C y C and C straddles C" , we have literals 



in appropriate clauses; and we must have a < a' and r' < r, or else C would straddle C. 
Hence C straddles C" . Similarly if C y C" and C" straddles C, then C straddles C. 
Therefore CyC'y C" implies that C y C" . 

In a set of nested clauses, we have C y C if and only if C straddles C . The transitivity 
of this relation implies that we can topologically sort any set of nested clauses into a linear 
arrangement in which each clause appears after every clause it straddles. When such an 
arrangement is given, and when the elements of each clause are presented in order, we 
will show that satisfiability can be decided in 0(m + n) steps on a RAM, where m is the 
number of clauses and n is the number of variables. 

(Incidentally, a set of nested clauses can be shown to have a tree-like structure, al- 
though we do not need this characterization in the algorithm. Let us write C < C if a < r' 
for all a E C and r' G C. If neither C nor C straddles the other, it is easy to see that 
we must have either C < C or C < C, unless C and C are both clauses on the same two 
literals. Suppose we call such 2-element clauses equivalent. Then a nested set of clauses 
will satisfy the condition 



because we cannot have C >~ C" and C >- C" when C < C. This means that >- is the 
ancestor relation in a hierarchy.) 

2. An algorithm. Let us assume that the alphabet X is represented as the positive 
integers {1,2,..., n}, with x = —x. The clauses will be specified in two arrays 



a' < < r' 



(C y C" and C y C") implies (C y C or C 



C or C'yC), 



lit[l . . 2m + n] and start[l . . m + 1] 



where the literals of clause i are 



lit[j] , 



for start[i] < j < start[i + 1] 
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in increasing order as j increases. The clauses are assumed to be arranged so that clause 
i does not straddle clause i' when % <%' . We can safely assume that all clauses contain at 
least two literals. 

The key idea of the algorithm below is that the interior variables of a clause are not 
present in subsequent clauses. Therefore we only need to remember information about the 
dynamically changing set of all variables 

1 = x\ < x 2 < ■ ■ ■ < Xk = n 

that have not yet appeared as interior variables. Initially k = n. 

The set of all clauses seen so far, as the algorithm proceeds to consider the clauses in 
turn, can be conceptually partitioned into intervals 

[x 1 ..X 2 ], [x 2 ..X 3 ], [Xk-I.-Xk], 

such that all literals of each previously processed clause belong to one of these intervals. 
The current intervals are maintained in an array 

next[l . . n] 

where next[xj] = Xj + \ for 1 < j < k. 

The only slightly complex data structure in the algorithm below is the array 

sat[l . .n, boolean, boolean] 

which has the following interpretation: If [xj . . Xj+i] is an interval of the current partition, 
then sat[xj, s,t] will be either or 1 for each pair s,t £ {false, true}. It is 1 if and only 
if the clauses already processed, belonging to the interval [ Xj . . Xj -|-l] ; 3X6 satisfiable by 
clauses in which the least and greatest literals are respectively Xj\s and Xj+i\t, where 

■ J —x , s = false; 
\ +x , s = true. 

For example, suppose we have seen only one clause, {1,-2}. Then we will have 
sat[l, false, true] = ; 

sat[l, false, false] = sat[l, true, false] = sat[l, true, true] = 1 . 

It turns out that the sat array contains all the information necessary to continue processing, 
because literals that have appeared as interior variables will not be present in subsequent 
clauses. 

The algorithm's main task is to maintain the sat array as it examines a new clause 
Ci = Wi, . . . ,o~ q }. The variables |<Ti| < • ■ ■ < \a q \ will be a subset of the current partition 
variables x\, . . . ,Xk- All of the current partition variables between |a"i| and \a q \, whether 
they appear in the new clause or not, are interior to the clause, so they will be removed. 
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Suppose 1 0i | 



x 



,. The algorithm proceeds by letting a variable x run through the val- 



|, maintaining information needed to update the values of sat[x p , s,t] 
when the interior variables of Cj are eliminated from the partition. Let Ci(x) be the literals 
of Ci that are strictly less than x, and let C(x) be the clauses preceding Ci whose literals 
are confined to the interval [x p . . x] . The updating process is carried out by computing 
auxiliary values newsat x [s,t] defined as follows: 

{0 , if C(x) is not satisfiable(s, t); 
1 , if C(x) is satisfiable(s, t) but C(x) U {Ci(x)} isn't; 
2, if C(x) U {Ci(x)} is satisfiable(s,£). 

Here 'satisfiable(s, t)' means there is a clause containing x p \s and x\t that has a nonempty 
intersection with each clause of the given set of clauses. 

For example, suppose Ci = {—1,2,4} and {x\, X2, #3, x±} = {1,2,3,4}, and suppose 
that the clauses C±, . . . , Ci-i have led to the following values: 

sat[l,s,t] sat[2,s,t] sat[3,s,t] 



false 
false 
true 
true 



t 

false 
true 
false 
true 




1 
1 
1 




1 
1 
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Then we have 
s t 

false 
true 
false 
true 



false 
false 
true 
true 



newsati 

1 


1 



newsat2[s, t] newsat^[s, t] newsat^s^t] 

2 
2 

1 2 
1 1 1 

and we will want to update the arrays by setting nextfl] <— 4 and 

sat[l, false, false] <— 0: 
sat[l, false, true] <— 
sat[l, true, false] <— 
sat[l, true, true] <— 1 . 

If Ci were { — 1,2, —4} instead of { — 1, 2, 4}, the computation of newsat would be the same, 
but the values of sat[l, s,t] would all become 0; the clauses would be unsatisfiable, since 
news at 4[true, true] is only 1, not 2. (The reader is encouraged to study this example 
carefully, because it reveals the key principles underlying the algorithm.) 

3. Programming details. It is convenient to assume that an artificial (m + l)st clause 
with the dummy variables {0, n + 1} has been added after C m . Therefore we will declare 
slightly larger arrays than stated earlier: 

start[l . . m + 2] ; next[0 . . n] ; sat[0 . . n, boolean, boolean] . 

There are two auxiliary arrays new sat[boolean, boolean] and tmp[boolean, boolean]. We can 
now decide the nested satisfiability problem as follows. 
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for x <— to n do next[x] <— x + 1; 
for £ <— to n do 

for s <— false to true do for t <— false to true do sa£[x, s, t] <— 1; 
for i <- 1 to m + 1 do 

begin / <— a&s [start [z]]); r = abs(lit[start[i + 1] — 1]); 
(Compute the newsat table); 
next[/] <— r; 

for s <— false to true do for t = false to true do 
sat[Z,s,t] <— newsat [s, t] div 2; 

end; 

if sat[0, true, true] = 1 then print ( 'Satisf iable ') else print ( 'Unsatisf iable '). 

The example in the previous section illustrates how the newsat table can be computed 
in general. We run the process slightly longer so that a good newsat value will be 2 (not 1) 
at the end. (The value of a q must be examined.) 

(Compute the newsat table) = 

j <— start[i\; sig <— lit[j]; x <— abs(sig); 
newsat[false, false] <— 1; newsat[true, true] <— 1; 
newsat[false, true] <— 0; news at[true, false] <— 0; 
while true do 

begin if x = abs(sig) then 

begin (Upgrade a newsat from 1 to 2, if possible); 

3^3 + 1; sig <- lit[j]; 

if j = start[i + 1] then goto done; 

end; 

(Modify newsat for the next x value); 
x <— next[x]; 
end; 
done: 

(Upgrade a newsat from 1 to 2, if possible) = 
t<-(x = sig); 
for s ^— /a/se to true do 

if new5at[s, t] = 1 then newsat[s,t] <— 2. 

(Modify newsat for the next x value) = 

for s <— false to true do for t <— /ake to true do 
tmp[s,t] <— max(newsat[s, false] * sat[x, false, t], 
news at[s, true] * sat[x, true, t]); 
for s <— false to true do for t <— false to true do 
newsat [s, t] <— tmp[s,t]. 

The running time is 0(m + n), because each value of x is either first or last in the 
current clause (accounting for 2(m + l) cases) or it is being permanently removed from the 
partition (in exactly n cases, because of the dummy clause {0, n + 1} at the end). 
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We have not considered here the time that might be required to test if a given satis- 
fiability problem is, in fact, nested under some ordering of its variables. 

Concluding remarks. This algorithm for nested satisfiability works by essentially replac- 
ing each clause by a clause containing only two literals, using a special form of "dynamic 
2 SAT" to justify the replacement. However, the instances of 2 SAT that arise are not 
completely general. This suggests that a somewhat larger special case of the satisfiability 
problem might be solvable in linear time by similar techniques. 
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